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DETAILED ACTION 
Response to Amendment 

1 . In response to the office action from 1 1/17/2003, the applicant has 
submitted an amendment, filed 3/2/2004, amending claims 1 and 6, while arguing 
to traverse the art rejection based on the amended limitation regarding a 
"summing the computed distance" and "each of the elements in the speech 
features and the reference template is a binary number" (Amendment page 2 and 
4). 

Applicant's arguments have been considered but are moot in view of the 
new grounds of rejection, necessitated by the amended claims, and based on 
Higgins (US Patent No. 5339385). 



Claim Rejections - 35 USC § 102 

The following is a quotation of the appropriate paragraphs of 35 
U.S.C. 102 that form the basis for the rejections under this section made in this 
Office action: 

A person shall be entitled to a patent unless -(b) the invention was patented or described in a 
printed publication in this or a foreign country or in public use or on sale in this country, more than 
one year prior to the date of application for patent in the United States. 



Claim 1 is rejected under 35 U.S.C. 102(b) as being anticipated by Higgins 
(US. Patent. No. 5,339,385). 
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1 . Regarding claim 1 , Higgins discloses a method for identifying an 
authorized user using a spectrogram includes the steps of: 

1 . Detecting an end point of a speech after a user speaks (col. 5, In. 36- 

37). 

2. Extracting speech features from a spectrogram of the speech (col. 5, In. 
13-19). 

3. Determining whether training is necessary, and, if so, taking the speech 
features as a reference template, setting a threshold (col. 6, In. 63-65) and going 
back to step 1, otherwise, proceeding to the next step (col. 5, In. 51-54). 

4. Matching patterns of the speech features and the reference template 
(col. 6, In. 20-21). 

5. Computing a distance of each element between the speech features 
and the reference template according to a matching result of step 4, and 
summing the computed distance to obtain a distance scoring (col. 6, equations 
1 a-b and 2, equation 2 teaches the average distance which is calculated by 
summing an N distance scores and then divided by N)\ wherein each of the 
elements in the speech features and the reference template is a binary number 
(col. 5, In. 4-33, the operation of the speaker verification apparatus disclosed by 
Higgins is performed in a digital system. Thus, the speech features and 
reference templates must be represented by binary numbers). 

6. Comparing the distance scoring with the threshold (col. 7, In. 63-66). 

7. Determining whether the user is authorized according to a compared 
result of step 6 (col. 7, In. 66-68). 
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Claim Rejections - 35 USC § 103 

The following is a quotation of 35 U.S.C. 103(a) which forms the basis for 

all obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described 
as set forth in section 102 of this title, if the differences between the subject matter sought to 
be patented and the prior art are such that the subject matter as a whole would have been 
obvious at the time the invention was made to a person having ordinary skill in the art to which 
said subject matter pertains. Patentability shall not be negatived by the manner in which the 
invention was made. 

Claim 3 is rejected under 35 U.S.C. 103(a) as being unpatentable over 
Higgins (US. Patent No. 5,339,385) in view of Princen (IEEE Transactions on 
Signal Processing, vol. 43, No. 11, November 1995). 



2. Regarding claim 3, Higgins discloses all the limitations of claim 3 in 
accordance with claim 1 , but fails to specifically disclose a method wherein the 
speech features are retrieved by using a Princen-Bradley filter bank to transform 
the detected speech signal to obtain a corresponding spectrogram. However, 
Princen teaches the method wherein the speech features are retrieved by using 
a Princen-Bradley filter bank to transform the detected speech signal to obtain a 
corresponding spectrogram (figures 1 and 2, filter response is shown on figures 6 
and 7). 

Since Higgins and Princen are analogous art because they are from the 
same field of endeavors, it would have been obvious to one of ordinary skill in the 
art at the time the invention was made to modify Higgins by incorporating a 
Princen-Bradley filter bank as taught by Princen in order to accurately construct 
the spectrogram of the input speech. 
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Claims 2 and 4-6 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Higgins (US. Patent No. 5,339,385) in view of Chen (US. 
Patent. No. 6,314,395). 



Regarding claim 2, Higgin discloses a method as claimed in claim 1 
wherein the detection of the end point of the speech in (i) (col. 5, In. 36-37) and 
(ii) converting analog speech signals to digital speech signals by an A/D 
converter (col. 5, In. 1-11), but fails to specifically disclose the steps of: 

(i) filtering the speech with a low-pass filter; 

(iii) pre-emphasizing the digital speech signals to thoroughly model lower- 
amplitude and higher-frequency parts of the speech; 

(iv) extracting a majority magnitude for each frame; and 

(v) comparing the majority magnitude of each frame with the threshold to 
determine a begin point and an end point of the speech. 

However, Chen teaches step (i) (10 of figure 1), step (iii) (30 of figure 1), 
step (iv) (col. 6, In. 10-15), and step (v) (col. 6, In. 19-21). The advantage of 
incorporating the steps taught by Chen into Higgins is to accurately detect the 
voice and unvoiced sections of the speech. 

Since Higgins and Chen are analogous art because they are from the 
same field of endeavors, it would have been obvious to one of ordinary skill in the 
art at the time the invention was made to modify Higgins by incorporating the 
steps taught by Chen in order to accurately detect the begin- and end-points of 
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speech words to increase reliabilities of the voice recognition system embedded 
in the user authorization system. 



3. Regarding claim 4, the combination of Higgins and Chen in accordance 
with claim 2 discloses all the limitations of claim 4, but fails to specifically disclose 
the majority magnitude is obtained by counting the total number of each absolute 
amplitude level, and the great majority of the absolute amplitude levels is defined 
as the majority magnitude of the current frame. However, Chen further teaches 
the majority magnitude is obtained by counting the total number of each absolute 
amplitude level (col. 8, In. 60-64), and the great majority of the absolute 
amplitude levels is defined as the majority magnitude of the current frame (col. 6, 
In. 48-50). The advantage of using the method of counting the majority 
magnitude, as taught by Chen, is to accurately detect the begin- and end-points 
of the speech word. 

Since Higgins and Chen are analogous art because they are from the 
same field of endeavors, it would have been obvious to one of ordinary skill in the 
art at the time the invention was made to further modify Higgins by incorporating 
the method of counting the majority magnitude as taught by Chen in order to 
accurately determine the begin- and end-points of speech words to increase 
reliabilities of the voice recognition system embedded in the user authorization 
system. 
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4. Regarding claim 5, the combination of Higgins and Chen in accordance 
with claim 2 discloses all the limitations of claim 5, but fails to specifically disclose 
the process of determining the begin point and the end point of the speech in the 
step (v) includes the steps of: 

(i) setting a threshold; 

(ii) determining whether the detection of the begin point is beginning, if yes 
going to step (iv), otherwise going to next step; 

(iii) determining whether the majority magnitudes of three adjacent frames 
are all larger than the threshold, if not, then changing the threshold and going on 
to the measurement of the next majority magnitude and going back to step (ii), 
otherwise the beginning point having been detected, going on to the 
measurement of the next majority magnitude and going back to step (ii); 

(iv) delaying a period of time; 

(v) determining whether the majority magnitudes of three adjacent frames 
are all smaller than the threshold, and, if not, going on the measurement of the 
next majority magnitude and going back to step (v), otherwise the end point has 
been detected. 

However, Chen further teaches step (i) (410 of figure 4), step (ii) (420 of 
figure 4), step (iii) (421-425 of figure 4), step (iv) (430 of figure 4), and step (v) 
(440 of figure 4). The advantage of using the steps as taught by Chen is to make 
sure that the actual begin- and end-points of a speech word are detected to 
increase reliabilities of the voice recognition unit embedded in the user 
authorization system. 
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Since Higgins and Chen are analogous art because they are from the 
same field of endeavors, it would have been obvious to one of ordinary skill in the 
art at the time the invention was made to further modify Higgins by incorporating 
the steps as taught by Chen in order to make sure that the actual begin- and 
end-points of the speech word are detected to increase reliabilities of the voice 
recognition unit embedded in the user authorization system. 



5. Regarding claim 6, Higgins discloses an apparatus for identifying an 
authorized user by using spectrograms (figure 1) comprising: 

an A/D converter for converting analog speech signals to digital speech 
signals (col. 5, In. 9-11). 

a digital signal processor for receiving digital speech signals from the A/D 
converter and performing operations in each step of the method as claimed in 
claim 1 ; (elements 5-7 of figure 1 , the combination of these elements can be 
considered as a digital signal processor) and 

a memory device for storing data of a threshold and a reference template 
which are required in the operations of the digital signal processor, wherein each 
element of the reference template is a binary number (col. 7, In. 33-40, the 
operation of the speaker verification apparatus disclosed by Higgins is performed 
in a digital system. Thus, the speech features and reference templates must be 
represented by binary numbers). 

Higgins fails to specifically disclose a low-pass filter for limiting the 
frequency range of submitted speech. However, Chen teaches a low-pass filter 
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for limiting the frequency range of submitted speech (10 of figure 1). The 
advantage of using a low-pass filter as taught by Chen is to eliminate the 
unwanted high frequency noise. 

Since Higgins and Chen are analogous art because they are from the 
same field of endeavors, it would have been obvious to one of ordinary skill in the 
art at the time the invention was made to further modify Higgins by incorporating 
a low-pass filter as taught by Chen in order to remove unwanted high frequency 
noise from the signal. 

Conclusion 

Applicant's amendment necessitated the new ground(s) of rejection 
presented in this Office action. Accordingly, THIS ACTION IS MADE FINAL. 
See MPEP § 706.07(a). Applicant is reminded of the extension of time policy as 
set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire 
THREE MONTHS from the mailing date of this action. In the event a first reply is 
filed within TWO MONTHS of the mailing date of this final action and the advisory 
action is not mailed until after the end of the THREE-MONTH shortened statutory 
period, then the shortened statutory period will expire on the date the advisory 
action is mailed, and any extension fee pursuant to 37 CFR 1.136(a) will be 
calculated from the mailing date of the advisory action. In no event, however, will 
the statutory period for reply expire later than SIX MONTHS from the date of this 
final action. 
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The prior art made of record and not relied upon is considered pertinent to 
applicants disclosure. Honda (US. Patent No. 5,293,448) teaches a speech 
analysis method for detecting voice and un-voice speech signal that is 
considered pertinent to the claimed invention. 

Any inquiry concerning this communication or earlier communications from 
the examiner should be directed to Huyen Vo whose email address is 
huven.vo@uspto.gov . The examiner can normally be reached on M-F, 9-5:30. 

If attempts to reach the examiner by telephone are unsuccessful, the 
examiner's supervisor, Doris To can be reached on 703-305-4827. The fax 
phone number for the organization where this application or proceeding is 
assigned is 703-306-0377. 

Any inquiry of a general nature or relating to the status of this application 
or proceeding should be directed to the receptionist whose telephone number is 
703-306-0377. 



Examiner: 




April 30, 2004 




DORIS H.T0 
SUPERVISORY PATENT EXAMINER 
TECHNOLOGY CENTER 2600 



